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(54) Video browsing on the worid wide web 

(57) A system and method are provided for support- 
ing video browsing over a communication network such 
as the Internet/World Wide Web. A graphical user inter- 
face is provided through a client software tool such as a 
Web browser. A client/user selects a video data object 
stored at a remote server. A set of points (52) within the 
object are displayed at the client's graphical user inter- 
face display, as representations, preferably thumbnail 
images, of the points within the object. The user selects 
an interval (56) defined by the representations, prefera- 

Tliree Stooges Movie 



biy the interval between two temporally adjacent repre- 
sentations. Responsive to this selection, a new set of 
points, falling within the selected interval, are chosen, 
arxl representations of those points are generated and 
displayed. By doing so repeatedly, the user can easily 
browse through the video data object, and quickly and 
easily zero in on a desired portion of the video data 
object. 
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Description 

Rdd of the Inventinn 

ticulJJv^^^2 to *° °* ""^"^^^ ^y^*^"^ ^"^^ muwrnedia communications, and more par- 

ticularly relates to the fieWs of video storage and compression, and of interactive video playback and browsing. 

Glossary of Termg I Iccri 

be u^L "leanings are also implied by certain terms used here, the following glossary of some terms may 

Intemet ("the Net") The connection system that links computers worldwide in a network 

JQEnPl Transmission Control Protocol/Internet Protocol. A packet switching scheme the Internet uses to chop 

route, and reconstruct the data it handles, from e-mail to video 

World Wide Weh (www . "the Weh-) The internet s application that lets people seeking information on the Internet 

J^K^^^' *° ^Z^'^ ''^^'^ *° ^^^'^"^ °" highlighted words or phrases of interest. ^ 

Internet Web server supports clients and provides information 

Homepaqe : A multi-media table of contents that guides a Web user to stored information on the Internet 
^mi A machine (computer) which performs a task at the command of another machine ("client"). In the context 
0* the present jiventoon. a server s primary function is to facilitate distribution of stored information over the Web 
CIlfiDt A machine which provides commands to a server, and is serviced by the server. Typically, a client machine 
IS operated by an end user, and functions responsive to user commands y. - o lern macnine 

^^^^ A program running on a user-operated client computer. When a user "surfs" the Web using a 

t^°?rnd™:;:?u^Xt";h" 
S^u.^;s:\7c:Sm^^^^^^^ 

tbEed!^ A network addressing tool embedded in a user-understandable displayed and/or highlighted item, such 
as a word, phr^ icon or picture. A URL can be accessed by means of its corresponding Hv^rlink. When a ^r 
ZL^r^^Z ^^^'""^ *^9mtnea hvperlink through the user inteSos. the uXying ite^is tir^ 
retrieved to the client supporting a Web browser. « » uwn 

mdicates that the document or file designated by the URL contains hyperlinks defined accoreling to the HTTP 
HyperText Marhip Lanquanf^ (HTMI ) html is the language used by Web servers to create and connect docu- 

Bernstein et al.. 5.204.947, issued April 20. 1993; 
Bernstein et al.. 5.297.249. issued March 22. 1994; and 
Lewis. 5.355.472. issued October 1 1 . 1994; 

all of which are assigned to International Business Machines Corporation, and which are referenced herein. 
Background Art 



-w^Tl^®!!* *^^"°'°9'es of video data compression, storage, and interactive accessing have converged 

muW:;^;iSrm'S^r°"' *° P^^^^"* ^'^'^^^^^ -eek access foV^T^SS 

in^Ja fl^f °* communications technologies, particluarly exciting has been the recent prominence of the 

2^r.ir^^^ "'I'^r " """^^ ^^^^ the public i^^Snat^n as Z 

t?r4!iiSrw*." "'^ ""^"^ ^ metaphorical 

aSThl^^^ confederaton of many drfferent networks, public and private, big and small, whose human oper- 
tt,Sr^^<^^rSil'^n^i^°"T°'''"^^ '^^ ""^"^ represented by these networks relies on no S^le 

2reTvr^:2r;Ll — - satel^e links. f»,er-optic trunk lines, phone linL. 

To this point the WorW Wide Web (Web) provided by the Intemet has been used in industry predominately as a 
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means of communication, advertisement, and placement of orders. The World Wide Web facilitates user access to 
information resources by letting people jump from one server to another simply by selecting a highlighted word, picture 
or icon (a program object representation) about which they want more information, a manoeuvre known as a "hyper- 
link". In order to explore the WWW today, the user loads a special navigation program^ called a "Web browser" onto his 
5 computer 

There are a number of browsers presently in existence and in use. Common examples are NetScape. Mosaic and 
IBM's Web Explorer. Browsers allow a user of a client to access servers located throughout the world for information 
which is stored therein. The information is then provided to the cliOTt by the server by sending files or data packets to 
the requesting client from the server's storage resources. 
10 Part of the functionality of a browser is to provide image or video data. Web still image or video information can be 
provided, through a suitably designed Web page or interface, to a user on a client machine. Still images can also be 
used as Hypertext-type links, selectable by the user, for invoking other functions. For instance, a user may run a video 
clip by selecting a still image. 

However, video data objects are very large, or, to put it more precisely, the quantity of data per unit time in a real- 
15 time viewing of a video data object is large. As a consequence, access by a user to a desired video data object is sub- 
ject to data throughput constraints. The present stale of the art makes it impracticable to provide more than a few tens 
of seconds of real-time video over the Internet with a response time that will be satisfactory to a user. 

Therefore, multimedia and communication systems for providing users with access to video data cAjjects, for brows- 
ing, searching, etc., must grapple with the problem of providing video data in a manner which best utilizes the available 
20 throughput to provide video data in a form which is most useful to the user. 

With this design objective in mind, let us now consider the state of the art in the technologies of video data com- 
pression, storage, and interactive accessing. Recent work has been done to make video material nrrore available and 
usable over the Web. For instance, an article in the August 1995 issue of ADVANCED IMAGING, by Amy T. Incremona. 
titied "Automatically Transcribing and Condensing Video: New Technology is Born", describes a method for providing 
25 video having an accompanying textual index, such as audio narration or closed caption text. Still images are presented, 
along witii a transcription of audio text that accompanies the images (illustration on page 60). This information is pro- 
vided in HTML format. Thus, a user can take advantage of the temporal correspondence between video shots and nar- 
ration or closed caption text. To find a desired point in the video corresponding with a known point in the text, the user 
performs a key word search for the known point in the text. The result of tiiis key word search is that the desired pwint 
30 in the video is reached. 

Additionally in Shahraray et ah, "Automatic Generation of Pictorial Transcripts of Video Programs". SPIE Vol. 241 7. 
pp. 512-518. there is described an automatic authoring system for the generation of pictorial transcripts of video pro- 
grams which are accompanied by closed caption information. The system employs a table having a series of rows, each 
row containing a pointer to a location of an innage, and another pointer to tiie beginning of a text segment related to the 

35 image. A viewing window for a GUI display is shown in FIG. 4 of Shahraray et al., and reproduced herein in sinplif ied 
form as FIG. 1 of the present patent application. FIG. 1 sfiows a video image 2, a closed caption text subtitie area 4, 
and a basic user control area 6. The basic user comrols include a "Seek" slider 8. 

Accordingly, the state of the art allows for user access to video information t>ased on associated text. However, a 
more general method for accessing video, not provided by the prior art, would sever the tie between video images and 

40 accompanying audio narration or closed caption text. 

Disclosure of the Invention 

It is therefore an objective of tine present invention to provide a system and method for allowing convenient user 
45 access to a stored video object, for viewing and browsing, through a communication medium having a client-server 
architecture, such as the World Wide Web. 

It is a further objective of the present invention to provide such convenient user access to a stored video object with- 
out requiring tiiat access to the video object to be keyed with any text or other accompanying indicia, not actually part 
of tiie video itself. 

50 To achieve these and other objects, tiiere is provided, in accordance with the invention, a method for displaying, on 
a user terminal, video data object information pertaining to a stored video data object, the stored video object including 
a tenporal beginning point, a temporal end point, arxJ temporal intermediate points tiierebetween. 
The method conprises the following steps: 

First, a sequence of representations of points witiiin tfie video data object, preferably still images, is provided to tiie 
55 user tiirough a user interface. Each representation or still image that is provided corresponds with one of tiie temporal 
points in the video data object. In a typical environment, in which the user is an Internet/World Wide Web user, and the 
video data object is stored in a remote repository accessible through a server, the step of providing the representations 
includes performing suitable operations to identify points within the vkJeo data object, for which representations are to 
be provided. A preferred technique is to detect scene cuts. 
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Then, a user interface is provided, including means for allowing a user to select an interval between first and sec- 
ond ones of the temporal points. The user interface is preferably a graphical user interface (GUI), as is commonly made 
available through computer operating systems such as IBM's OS/2 and Microsoft's Windows operating systems. Also 
suitable user interface equipment, such as a video screen and a mouse, are preferably used. 

Finally, responsive to user selection of such an interval, a subsequence of representatior^ is provided, each repre- 
sentation of the subsequence corresponding with a respective tenporal point in the video data object, each of the 
respective temporal points falling between the first and second tenporal points selected as discussed above 

Therefore, by providing this capability, the invention allows a user to iteratively home in on the portion of a lengthy 
large video object that he/she is interested in. The invention advantageously eliminates the need to use text accompa- 
nying the video object as a crutch, as was done in the prior art discussed above. 

While the invention is primarily disclosed as a method, it will be understood by a person of ordinary skill in the art 
that an apparatus, such as a conventional data processor, including a CPU. memory, I/O. program storage, a connect- 
ing bus. and other appropriate components, could be programmed or otherwise designed to facilitate the practice of the 
method of the invention. Such a processor would include appropriate program means for executing the method of the 
invention. Also, an article of manufacture, such as a pre-recorded disk or other similar computer program product for 
use with a data processing system, could include a storage medium and program means recorded thereon for directing 
the data processing system to facilitate the practice of the method of the invention. 

Brief Description of the Drawing s 

which*^^ described, by way of example only, with reference to the accompanying drawings, in 

FIG. 1 is an illustration of a prior art graphical user interface for video browsing; 

FIG. 2 is a system block diagram of a typical distributed computer system which has internal and external networks 
including the Internet, to connect clients to Worid Wide Web servers and other servers, the client systems being 
capable of incorporating the invention; 

30 FIG. 3 is a high-level flowchart showing the method of the invention; 

FIG. 4 is a flowchart showing a more detailed implementation of step 22 of the flowchart of FIG. 3; 

FIG. 5 is a flowchart showing a still more detailed implementation of step 28 of the flowchart of FIG 4* 

35 ' ' 

FIG. 6 is an illustration of a graphical user interface according to the invention; 
FIG. 7 is an illustration of a graphical user interface according to the invention; and 
40 FIG. 8 is a flowchart showing a more detailed implementation of step 44 of the flowchart of FIG. 3. 
Description of tfie Preferred Embodiment 

FIG. 2 illustrates a typical communication and processing system, including nodes (computers, processors, etc ) 
45 and communication media therebetween, making up an information delivery solution of a typical combination of 
resources including clients, servers, information repositories, and communication links or networks Various nodes 
shown typically as 9. are also coupled to one another by various networks, shown generally as 7. which may include 
local area networks (LANs), wide area networks (WANs), etc. 

The discussion of FIG. 2 which follows will focus on certain individual components for clearer illustration and 
50 description of the novel and non-obvious features of the present invention. In the system of FIG. 2. a dient machine 1 0 
includes a connputer or other device (as discussed above), running a Web browser program. The program run by the 
client 10, which incorporates the invention, may be provided to a general purpose computer by means of a commercial 
computer program product, such as a pre-recorded floppy disk 11 or other suitable computer-readable recording 
medium. In such a computer program product, the novel and non-obvious features of the invention, implemented as 
program code, are pre-programmed onto the disk 1 1 for convenient sales and marketing, and for directing operation of 
the client machine 10 in accordance with the invention. 

The client 10 is coupled, through a communication network 12 such as the Internet to a Web sen/er 14. Physical 
access to the Internet is provided in conventional fashfon. The general command protocols, eto.. for exchanging mes- 
sages between the client 10 and the server 12, are also conventional. 
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The client machine 10 vnay preferabiy be an intelligent conputer system, such as an IBM PS/2 conrputer, an IBM 
ThinkPad laptop computer, or an IBM RISC System/6000 workstation. Communication coupling made to the network 
via a suitable communication interface software tool, such as IBM's OS/2 WARP Connect software product. (IBM. PS/2. 
ThinkPad. RISC System/6000 and OS/2 are trademarks of IBM Corporation.) 
5 The Web browser in the client machine 1 0 may preferably be the IBM Web Explorer software product or equivalent 

software tools such as the NetScape or Mosaic tools. This computer system 1 0 Is bi-directionally coupled with the OS^ 
WARP Connect facility over a line or via a wireless system to the server machine 14. The server machine 14 may pref- 
erably be another IBM PS/2 computer, an IBM RISC System/6000 workstation, or other similar system. 

The program run by the server 14, which incorporates the invention, may be provided to a general purpose com- 
10 puter on a pre-recorded medium, as discussed above, such as a floppy disk 15. in which case the novel and non-obvi- 
ous features of the program code are pre-programmed onto the disk 15 for convenient sales and marketing, and for 
directing operation of the server machine 14 in accordance with the invention. 

FIG. 3 is a high level flowchart showing the operation of the method of the invention. A preferred embodiment of the 
invention employs a client machine supporting a user interface utilizing HTML format, and further employs the World 
15 Wide Web. Much of the functionality of the invention may be carried out at the server. However, the description which 
follows will be readily understandable from the vantage point of the user at the client machine. 

Initially, the user selects a video object (step 20). This may be done in any suitable way. The Web provides numer- 
ous fornrats. techniques, etc.. for selecting objects. In the preferred enrtKxJiment. selection of the video object by tiie 
user causes a request to be sent over the Web from the client machine to the server machine. In particular, a Web page. 
20 provided by the server 14 to the client 10. may include a graphical interface, such as Hypertext links, for allowing the 
user to select an object. A preferred way of facilitating this user selection is by means of a graphical interface such as 
the Web page shown in FIG. 6 (discussed below). The Web page of FIG. 6 may be designed and implemented using 
the known techniques such as those given in the references discussed atxjve. 

In step 22, a first set of representations of selected points within the video data object are displayed, responsive to 
25 the user request. In the preferred embodiment employing tiie World Wide Web. the selection of ttie selected points in 
step 22 takes place at tiie server. A more detailed description of the server activity implementing step 22, for tiiis pre- 
ferred embodiment, is given in FIG. 4. 

Referring to FIG. 4. the server initially receives a user request for a video object (step ^4). The server accesses the 
video object (step 26), and selects a set of temporal points in tiie object (step 28, described in still more detail in FIG. 
30 5). The server then produces representations of the selected temporal points for display (step 30). In a preferred 
embodiment, employing the Web and HTML links, the representations of the temporal points are thumbnail images cor- 
responding witii the temporal points in tiie video object. Finally, tiiis information is provided for display (step 32). prefer- 
ably by transmitting, from the server to the client machine, a display such as a Web page. A preferred inplementation 
is that of the Web page shown in FIG. 6 (discussed below). 
35 Before the display is illustrated and described in detail, a more detailed description will be made of step 28 (select- 
ing temporal points). This may be done in any suitable fashion. A few possiljle ways would be to select temporal points 
at random, or evenly spaced, within the video object. 

However, it is pretended that tiie temporal points be selected at scene cut points in tiie video object. In a preferred 
embodiment of tiie invention, st^ 28 is implemented as shown in more detail in FIG. 5. 
40 Referring to FIG. 5, a scene change measure is computed for adjacent frames (step 34). In a particular preferred 
embodiment, the measure is computed by calculating a normalized correlation between the two frames. (For discussion 
purposes, tiie frames will be referred to as the "image" I and tiie "model" M. without necessarily implying any temporal 
order or sequence between them.) 

It is possible to compute the scene change measure using straight correlation, in which case, each pixel of tiie 
45 image I is multiplied with a corresponding pixel of the model M, and a running sum of the products is accumulated. In 
such a scheme, correspondence might be established by means of row-and-column coordinates or other suitable meth- 
ods. 

However, a particularly preferred approach is as follows: For each pixel of the image I. a neighbourhood of tiie cor- 
responding point in tiie nrKxi^ M is considered. For calculating tiie frame change measure, a point within tiie neighbour- 
so hood of the model M whose value is closest to the value of the point in the image I. preferably closest in terms of tiie 
grey scale value, is multiplied by tiie value of the point in the image I. 

It has been found that using the neighbour point having the nearest value, rather than the exact corresponding 
point, improves response to motion in areas of high frequency image data. The set of neighbour points in tiie Model, for 
a given point in tiie Image, may be established in any suitable manner, such as by identifying tfie Image point according 
55 to a system siich as row-and-column coordinates, and defining tiie Model neighbourhood as the points whose row and 
column coordinates matched tiiose of the Image point, or were either one above or one below the Irrage point coordi- 
nate values. 

Also, a normalized version of correlation is used, in order to reduce sensitivity to changes in illumination between 
images of essentially tiie same scene. 
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A preferred formula, incorporating both the correlation function and the pixel-neighbourhood search described 
above, has been found to work well. Specifically, it is easy to threshold, because it is already normalized. That formula 
is as follows: 



In this expression, r is the scene change measure, and n is the number of active pixels used in the measure calculation. 
Depending on the particular drcumstances or the preference of the user, n can be the total number of pixels in the 
frame, or a subset of the total nunt»er used for the calculation. 

15 Ij is the value (such as the grey scale value) for the i-thy one of the pixels in the image I. Mj is the value for one of 
the pixels in the model M which is to be paired up with Ij for the above-discussed scene change calculation. 

The subscript i, for distinguishing between pixels, is a simplified representation for the purpose of the present dis- 
cussion. Any suitable an^angement for distinguishing between pixels, which would be understood or deemed appropri- 
ate to a particular implementation, nnay be used. For Instance, if the pixels are a rectangular array, then the subscript 

20 might be an ordered pair, such as row and column counts. A polar coordinate system, or other systems which would be 
understood to be suitable for a particular implementation, could also be used. 

Note that, for this purpose. Mj is either the i-thy pixel, or a pixel In the neighbourhood of the I-thy pixel. That is. if. for 
instance, a row-and-column coordinate system were used, and a neighbourhood were defined as given above, then the 
Image point I j = I ^ and the Model point M j = M ^.^^^ ^ might be paired up, if M(r+i) ^ had the closest value to Ire of any 

25 of the Model points in the defined neighbourhood. 

Once the scene change values have been determined for the various pairs of adjacent frames in the video object, 
they are compared with a frame change threshold (step 36 in FIG. 4). As stated above, this expression produces a nor- 
malized scene change value, the value being 0 for two identical frames. It is convenient to work with a scene change 
threshold having a value between 0 and 1 . However, any suitable measure of the threshold may be employed. 

30 Then, a test is made (step 38) to determine whether the value is greater than the threshold (alternatively, greater 
than or equal to the threshold; whichever is considered suitable according to the particular implementation). The result 
of this test determines whether the two frames are treated as a scene change (steps 40. 42). 

A preferred graphical implementation of the user interface is shown In FIGs. 6 and 7. In a preferred emtxxjiment. 
to be used in an environment such as the WorW Wide Web. the graphical interfaces are implemented using HTML. 

35 Details of this implementation will be omitted, since they would be known to persons skilled in the fields of HTML and 
other graphical user interfaces. 

Referring first to FIG. 6. a selection menu is provided for the user. The menu lists video objects which are available. 
The listing may Include thumbnail images 46 which illustrate the content of the video objects, text captions 48 naming 
or describing the video objects, or other suitable descriptors. Also, the menu may Include other information which may 

40 be of use to the user, such as the characteristics of the stored file. For instance. FIG. 6 shows a legend "Energy 1 mpeg 
file" 50. which informs the user that the video data object contains image information which is corrpressed according 
to the MPEG data compression standard. The listed video objects are preferably provided in HTML format, so that the 
user can select a video object (step 20 of FIG. 3) by a suitable means such as nrrause-dicklng on the thumbnail image 
or other identifier. 

45 Responsive to the user command, the server 1 4 obtains the video object and. as per FIG. 4. responds to the client 
machine 10. In step 44 of FIG. 3. a user interface is provided for viewing and browsing the video object. A preferred 
embodiment, for use on the Web. is illustrated in FIG. 7. 

FIG. 7 shows a graphical Image that would be displayed on the client machine for the user's viewing. In accordance 
with the Invention, representations of several temporal points in the video object are shown. Preferably, these represen- 

50 tations are shown as thun^nail inrrages 52. For ease of comprehension, the thumbnail images 52 are preferably shown 
in temporal order, in a user-intuitive arrangement such as from left to right, forming a "storyboard" that lets the user scan 
his/her eyes from left to right, and get a sense of the sequence of images or events in the video object. 

To further assist the user in getting a sense of where the displayed thumbnail images occur in the video object, a 
representation of the video object in its entirety, and of the portion covered by the displayed thumbnail images. Is given. 

>5 In FIG. 7. this is shown as a slider bar 54. The slider bar 54 Is shown horizontally, but can be in any suitable configura- 
tion, preferably a configuration which comports well with user intuition. The entire horizontal length of the slider bar 54 
represents the total duration of the video object, and a marker 56 represents the relative location, within the video 
object, of the displayed thumbnail images. Preferably, the marker 56 has a width, relative to the width off the entire slider 
bar 54. which reflects the portion of the entire video data object covered by all of the displayed refxesentatlons. 
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The marker 56 may be moved using the technique, familiar to users of slider bars in conventional GUI window appli- 
cations, of moving a cursor onto the marker 56 with a mouse, holding the mouse button down, and dragging and drop- 
ping the marker 56 by moving the mouse. Alternatively, a GUI control panel 55 may be provided, having buttons such 
as the left, right, fast left, and fast right button shown. By positioning the cursor on one of these buttons, or other suitable 

5 control buttons, and pressing a mouse button, the user causes the marker 56 to scroll along the slider bar 54. 

Regardless of what particular technique the user uses to move the marker 56 along the slider bar 54. representa- 
tions of points within the video data object appear and disappear to represent the key frames in whichever part of the 
video data object the marker 56 is covering at that moment. Thus, by moving the marker 56. the user selects a portion 
of the video data object which he or she wishes to view in more detail. 

10 Also in accordance with the invention, identifying indicia such as consecutive numbers may be assigned to the key 
frames, to assist the user in keeping track of which key frames have been examined, or where, within the video data 
object as a whole, a given image appears. This information nnay be provided in the user interface. A shot count 57 dis- 
plays the indicia, in the form of consecutive numbers. For instance, in FIG. 7, the marker 56 is at the left end of the slider 
bar 54. Let us say, then, that the first six thumbnails in the video data object being viewed are displayed. Accordingly, 

15 the shot count 57 displays a value of 1. indicating that the left-most thunrtonail image represents the first thumbnail (i.e., 
the earliest temporal point) of the video data object. If a user were then to use the controls 55 or the marker 56 to move 
through the video data object, then, as the marker 56 moved to the right and different thumbnails 52 appeared for later 
points in the video data object, the shot count 57 would likewise ascend in value. As a result, a user might identify a shot 
of particular interest, and remember it for easy future access by taking note of the value of the shot count. 

20 Step 22 of FIG. 3 is illustrated, in a preferred embodiment, by the thumbnail images 52 and the slider bar 54, dis- 
cussed above. 

Also in accordance with the invention, the user interface includes means for selecting an interval within the data 
objects. The interval is selected in terms of the displayed representations. Preferably, the user selects one of the repre- 
sentations, and the interval is selected in terms of an interval between the selected representation and another one of 
25 the representations, such as the temporally subsequent one of the representatior^. The method step 44 of FIG. 3, in 
which a user interface is provided for facilitating viewing and browsing, is implemented by. providing the means for 
selecting. 

In accordance with the invention, the user interface provided in step 44 includes means for selecting an interval 
between two of the key frames. Preferably the means for selecting an interval operates in conjunction with the graphical 
30 user interface GUI of the system according to the invention. That is, a user uses a GUI input device, such as a mouse, 
to select a representation of a desired interval from several representations of intervals shown on the display. 
A preferred way of performing step 44 is shown in FIG. 8. 

Referring to FIG. 8, a user is initially presented with a set of representations of frames of the video object, such as 
the sequence 52 of thumbnails shown in FIG. 7. The user views the thumbnails and decides that a portion of the video 

35 Object which he/she would like to examine in more detail falls between two of the thumbnails. The user selects the inter- 
val (step 58) through the graphical user interface. In the preferred embodiment of FIG. 7, for instance, the user selects 
one of the thumbnail images. The thumbnail image is inrplemented as a hypertext link, and the user's selection of the 
thumbnail indicates that he/she is interested in the interval between that thumbnail and an adjacent thumbnail (prefer- 
ably the thumtjnail which follows the selected thunr^nail, temporally). 

40 The user*s selection of the thumbnail is processed in known manner as a hypertext selection, and a message is 
sent to the server. The server examines the portion of the video object between the selected frame and the next frame. 
In a manner similar to that of step 28 in FIG. 4, the server selects a new set of key frames falling between the selected 
frame and the next frame (step 60). 

The selected new frames are then displayed (step 62). essentially in the same manner as before (steps 30 and 32 

45 of FIG. 4). Also, the slider bar 56 reflects the selection of the interval by moving and/or changing in width. 

In many cases, the user will want to perform several iterations of this process, in order to narrow down to the par- 
ticular small area of interest within the video object. Accordingly, a repetition loop is shown in FIG. 8 (step 64 and the 
loop back to step 58). 

50 Claims 

1 . A method for facilitating user examination and browsing of video data object information pertaining to a stored video 
data object (Object), the Object including a beginning point, an end point, and points therebetween, the method 
comprising the steps of: 

55 

providing (30), to a user interface, a sequence of representations, each represerrtation corresponding with one 
of the points in the Object; 

a user utilizing a user interface to select (60) an interval of the Object in ternris of a first one of the points; and 
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responsive to user selection of such an interval, providing (62) a subsequence of representations, each repre- 
sentation of the subsequence corresponding with a respective point in the Object, each of the respective points 
falling within the selected interval. 

2. A method as claimed in claim 1 , wherein the step off providing (30) a sequence of representations includes com- 
municating, through a network communication medium (7). with a video repository (14) having video data objects 
including the Object, stored therein. 

3. A method as daimed in claim 1 . wherein the step of providing a sequence of representations includes displaying 
(32) still images representative of content of the Object at respective ones of ttie points in the Object 

4. A method as claimed in claim 1 , wherein the step of providing a sequerrce of representations includes: 
generating tiiumbnail images based on content of the Object at respective ones of the points in the Object; and 

providing thumbnail images representing still frames of the video object at the respective points therein. 

5. A metiiod as claimed in claim 1 , wherein the step of providing a user interface includes: 

displaying a representation of the Object in its entirety; and 

displaying a representation of a portion of the Object covered by the displayed representations of points in the 
Object. 
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6. A metiiod as claimed in claim 5, wherein: 
the step of displaying the representation in its entirety includes displaying a slider bar (54); and 
the step of displaying a representation of a portion of the Object covered by tiie displayed representations 

of points in tfie Object includes displaying a marker (56) on tiie slider bar (54). 

7. A method as claimed in claim 6, wherein: 
the step of selecting (60) an interval includes moving tfie marker (56) along the slider bar (54) from a first 

position thereon to a second position thereon; and 

tiie step of displaying the sequence of representations (52) includes the step (62). responsive to movement 
of the marker along ttie slider, of displaying representations of points within the Object corresponding with the sec- 
ond position of the marker, in place of previously displayed representations of points wittiin the Object con-espond- 
35 ing with ttie first position of the ntarker. 

8. A method as claimed in daim 1 , wherein tiie step of selecting an inten/al indudes selecting one of the representa- 
tions of still images which forms a boundary of the interval to be selected. 

40 9. A metiiod as claimed in claim 8, wherein ttie step of selecting an interval further includes selecting one of ttie rep- 
resentations of still images which forms a first boundary of the interval to be selected, a next one of ttie represen- 
tations forming a second boundary of the interval. 
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55 



1 0. A system for fadlitating user examination and browsing of video data object information pertaining to a stored video 
data object (Object), the Object induding a beginning point, an end point, and points ttierebetween, ttie system 
comprising: 

a display unit; 

an interface for coupling the display unit witii a network communication medium (7). ttie medium being coupled 
to a video repository (14) having video data objects, including the Object, stored therein; 

means for sending, through ttie interface and over the medium, a request to view the Object; 

means, responsive to receipt of a response to ttie request to view, for displaying a sequence of representations 
(52). each representation conesponding with one of the points in ttie Object; 

means (56) for selecting an interval in terms of a first one of the points; and 
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means, operable responsive to operation of the means for selecting an interval, for providing a sul>sequence 
of representations, each representation of the subsequence corresponding with a respective poirrt in the 
Object, each of the respective points falling within the selected interval. 

5 11. A system as claimedlh claim 10. wherein: 

the network communication medium (7) includes the Internet; 

the interface includes an Internet interface; and 

the video repository (14) includes a server on the Internet. 

10 12. A system as claimed in claim 10, wherein the beginning point, the end point, and the points are temporal points 
within the Object. 
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